|
|
|
|
|
|
Fast Measurement of Sugar in Fruits Using Near Infrared Spectroscopy Combined with Random Forest Algorithm |
LI Sheng-fang1,2, JIA Min-zhi1, DONG Da-ming2,3* |
1. Taiyuan University of Technology, Taiyuan 030024, China
2. Beijing Research Center for Intelligent Equipment for Agriculture, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China
3. National Engineering Technology Research Center for Agricultural Intelligent Equipment, Beijing 100097, China |
|
|
Abstract In recent years, many researchers have studied the measurement methods of fruit sugar and other internal quality by near-infrared (NIR) spectroscopy and some commercial instruments have been produced. However, due to the complexity of the NIR spectra, the transitivity of the models established with NIR is often poorly performed. The model is only built for a particular species or even a certain variety. Random forest (RF) is an integrated algorithm based on decision tree, which improves the prediction accuracy by integrating the classification regression tree (CART) model. Compared with partial least squares (PLS), multiple linear regression (MLR) and other methods, RF algorithm has the strong analytical ability of nonlinear data. Taking into account the randomness of the RF model, the model is optimized by debugging the number of decision tree (ntree) and the number of split variables (mtry). In this study, we used RF to predict the sugar content in different types of fruits (apple and pear). Experimental results showed that for the same kind of fruit, the modeling and predicting results of RF and PLS were better. However, for different types of fruits, RF significantly increased the prediction ability of the model. The R2 of PLS model was 0.878 and the R2 of RF model was increased to 0.999. The RMSEC of PLS model and RF model were respectively 0.453 and 0.015. In addition, the optimal RF model was tested by independent test set samples, the R2 of PLS model was 0.731 and the R2 of RF model was increased to 0.888. The RMSEC of PLS model and RF model were respectively 1.148 and 0.334. RF showed a significant advantage in predicting a variety of fruit sugar. This research proved that the RF method could be applied to detect the sugar content in fruits by NIR spectroscopy, thus solving the model problem of universality and transitivity.
|
Received: 2017-07-05
Accepted: 2017-11-14
|
|
Corresponding Authors:
DONG Da-ming
E-mail: damingdong@hotmail.com
|
|
[1] Choi J H, Chen P A, Lee B H N, et al. Scientia Horticulturae, 2017, 220: 147.
[2] Lee Y, Han S H. Bulletin of the Korean Chemical Society,2016,37(2):166.
[3] Nogales-Bueno J, Hernández-Hierro J M, Rodríguez-Pulido F J. Food Chemistry, 2014, 152: 586.
[4] Wu X, Wu B, Sun J, et al. Journal of Food Process Engineering, 2017,40(2).
[5] Toivonen P, Batista A, Lannard B, et al. Canadian Journal of Plant Science, 2017, 97(6):1030.
[6] Marques E J N, de Freitas S T, Pimentel M F, et al. Food Chemistry 2016, 197: 1207.
[7] DONG Jin-lei, GUO Wen-chuan(董金磊,郭文川). Optics Preci. Eng(光学精密工程), 2015,23(6):1530.
[8] LI Yan-xiao, HUANG Xiao-wei, ZOU Xiao-bo, et al(李艳肖, 黄晓玮, 邹小波, 等). Journal of Food Safety and Quality(食品安全质量检测学报), 2014, 5(6):1679.
[9] LI Mao-mao, ZHENG Xi-qun, REN Jian, et al(李毛毛, 郑喜群, 任 健, 等). Journal of Food Safety and Quality(食品安全质量检测学报),2015, 6(8): 3026.
[10] Xu H, Qi B, Sun T, et al. Journal of Food Engineering, 2012, 109 (1): 142.
[11] Li B, Wei Y, Duan H, et al. Vibrational Spectroscopy, 2012, 62: 72.
[12] MO Fei-fan, FAN Wei, ZHOU Ji-heng, et al(莫菲凡, 范 伟, 周冀衡, 等). Journal of Food Safety and Quality(食品安全质量检测学报), 2014, 5(8): 2430.
[13] CHEN Hua-zhou, CHEN Fu, SHI Kai(陈华舟, 陈 福, 石 凯). Transactions of the Chinese Society of Agricultural Machinery(农业机械学报), 2015, 46(5): 233.
[14] Olarewaju, Olaoluwa Omoniyi, Isa Bertling, et al. Scientia Horticulturae 2016, 199: 229. |
[1] |
GAO Feng1, 2, XING Ya-ge3, 4, LUO Hua-ping1, 2, ZHANG Yuan-hua3, 4, GUO Ling3, 4*. Nondestructive Identification of Apricot Varieties Based on Visible/Near Infrared Spectroscopy and Chemometrics Methods[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 44-51. |
[2] |
LIU Jia, ZHENG Ya-long, WANG Cheng-bo, YIN Zuo-wei*, PAN Shao-kui. Spectra Characterization of Diaspore-Sapphire From Hotan, Xinjiang[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 176-180. |
[3] |
BAO Hao1, 2,ZHANG Yan1, 2*. Research on Spectral Feature Band Selection Model Based on Improved Harris Hawk Optimization Algorithm[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 148-157. |
[4] |
HE Qing-yuan1, 2, REN Yi1, 2, LIU Jing-hua1, 2, LIU Li1, 2, YANG Hao1, 2, LI Zheng-peng1, 2, ZHAN Qiu-wen1, 2*. Study on Rapid Determination of Qualities of Alfalfa Hay Based on NIRS[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3753-3757. |
[5] |
HU Cai-ping1, HE Cheng-yu2, KONG Li-wei3, ZHU You-you3*, WU Bin4, ZHOU Hao-xiang3, SUN Jun2. Identification of Tea Based on Near-Infrared Spectra and Fuzzy Linear Discriminant QR Analysis[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3802-3805. |
[6] |
LIU Xin-peng1, SUN Xiang-hong2, QIN Yu-hua1*, ZHANG Min1, GONG Hui-li3. Research on t-SNE Similarity Measurement Method Based on Wasserstein Divergence[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3806-3812. |
[7] |
LU Wen-jing, FANG Ya-ping, LIN Tai-feng, WANG Hui-qin, ZHENG Da-wei, ZHANG Ping*. Rapid Identification of the Raman Phenotypes of Breast Cancer Cell
Derived Exosomes and the Relationship With Maternal Cells[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3840-3846. |
[8] |
BAI Xue-bing1, 2, SONG Chang-ze1, ZHANG Qian-wei1, DAI Bin-xiu1, JIN Guo-jie1, 2, LIU Wen-zheng1, TAO Yong-sheng1, 2*. Rapid and Nndestructive Dagnosis Mthod for Posphate Dficiency in “Cabernet Sauvignon” Gape Laves by Vis/NIR Sectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3719-3725. |
[9] |
WANG Qi-biao1, HE Yu-kai1, LUO Yu-shi1, WANG Shu-jun1, XIE Bo2, DENG Chao2*, LIU Yong3, TUO Xian-guo3. Study on Analysis Method of Distiller's Grains Acidity Based on
Convolutional Neural Network and Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3726-3731. |
[10] |
LUO Li, WANG Jing-yi, XU Zhao-jun, NA Bin*. Geographic Origin Discrimination of Wood Using NIR Spectroscopy
Combined With Machine Learning Techniques[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3372-3379. |
[11] |
ZHANG Shu-fang1, LEI Lei2, LEI Shun-xin2, TAN Xue-cai1, LIU Shao-gang1, YAN Jun1*. Traceability of Geographical Origin of Jasmine Based on Near
Infrared Diffuse Reflectance Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3389-3395. |
[12] |
YANG Qun1, 2, LING Qi-han1, WEI Yong1, NING Qiang1, 2, KONG Fa-ming1, ZHOU Yi-fan1, 2, ZHANG Hai-lin1, WANG Jie1, 2*. Non-Destructive Monitoring Model of Functional Nitrogen Content in
Citrus Leaves Based on Visible-Near Infrared Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3396-3403. |
[13] |
HUANG Meng-qiang1, KUANG Wen-jian2, 3*, LIU Xiang1, HE Liang4. Quantitative Analysis of Cotton/Polyester/Wool Blended Fiber Content by Near-Infrared Spectroscopy Based on 1D-CNN[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3565-3570. |
[14] |
YAN Xing-guang, LI Jing*, YAN Xiao-xiao, MA Tian-yue, SU Yi-ting, SHAO Jia-hao, ZHANG Rui. A Rapid Method for Stripe Chromatic Aberration Correction in
Landsat Images[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3483-3491. |
[15] |
HUANG Zhao-di1, CHEN Zai-liang2, WANG Chen3, TIAN Peng2, ZHANG Hai-liang2, XIE Chao-yong2*, LIU Xue-mei4*. Comparing Different Multivariate Calibration Methods Analyses for Measurement of Soil Properties Using Visible and Short Wave-Near
Infrared Spectroscopy Combined With Machine Learning Algorithms[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3535-3540. |
|
|
|
|